UTD-CRSS Systems for 2016 NIST Speaker Recognition Evaluation
نویسندگان
چکیده
This study describes systems submitted by the Center for Robust Speech Systems (CRSS) from the University of Texas at Dallas (UTD) to the 2016 National Institute of Standards and Technology (NIST) Speaker Recognition Evaluation (SRE). We developed 4 UBM and DNN i-vector based speaker recognition systems with alternate data sets and feature representations. Given that the emphasis of the NIST SRE 2016 is on language mismatch between training and enrollment/test data, so-called domain mismatch, in our system development we focused on: (i) utilizing unlabeled in-domain data for centralizing i-vectors to alleviate the domain mismatch; (ii) selecting the proper data sets and optimizing configurations for training LDA/PLDA; (iii) introducing a newly proposed dimension reduction technique which incorporates unlabeled in-domain data before PLDA training; (iv) unsupervised speaker clustering of unlabeled data and using them alone or with previous SREs for PLDA training, and finally (v) score calibration using unlabeled data with “pseudo”speaker labels generated from speaker clustering. NIST evaluations show that our proposed methods were very successful for the given task.
منابع مشابه
UTD-CRSS Systems for 2012 NIST Speaker Recognition Evaluation The CRSS SRE Team
This document briefly describes the systems submitted by the Center for Robust Speech Systems (CRSS) from The University of Texas at Dallas (UTD) for the 2012 NIST Speaker Recognition Evaluation. We developed a state-of-the-art i-vector based speaker recognition system [1]. Probabilistic linear discriminant analysis (PLDA) [2] along with several other backends are used for channel/noise compens...
متن کاملThe CRSS systems for the 2010 NIST speaker recognition evaluation
This document briefly describes the systems submitted by the Center for Robust Speech Systems (CRSS) from The University of Texas at Dallas (UTD) in the 2010 NIST Speaker Recognition Evaluation. Our systems primarily use factor analysis as feature extractor [1] and support vector machine (SVM) classification framework. Our main focus in the evaluation is on the telephone trials in the core cond...
متن کاملUtd-crss Systems for Nist Language Recognition Evaluation
This study summarizes the overall solution and sub-systems developed by the Center for Robust Speech Systems (CRSS) at the University of Texas at Dallas to address the NIST LRE-2011 competition. CRSS-UTD employs five core sub-systems in the proposed language ID solution that include: (1) i-vector, (2) SVM-GSV, (3) PPRLM, (4) Articulatory Feature based, and (5) Prosody based. The first four repr...
متن کاملSUT Submission for NIST 2016 Speaker Recognition Evaluation: Description and Analysis
In this paper, the most recent Sharif University of Technology (SUT) speaker recognition system developed for NIST 2016 Speaker Recognition Evaluation (SRE) is described. The major challenge in this evaluation is the language mismatch between training and evaluation data. The submission is related to the fixed condition of NIST SRE 2016 and features a full description of the database and the sy...
متن کاملThe 2012 NIST speaker recognition evaluation
In 2012 NIST held the latest in an ongoing series of textindependent speaker recognition evaluations (SRE’s). The 2012 NIST Speaker Recognition Evaluation (SRE12) was the largest and most complex SRE to date, including over 100 million trials. Several aspects of SRE12 were new; most significantly, NIST released in advance of the evaluation target speaker training data from six preexisting corpo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017